Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 12651 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 22 |
| Duplicate rows (%) | 0.2% |
| Total size in memory | 1.8 MiB |
| Average record size in memory | 147.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 10 |
| Dataset has 22 (0.2%) duplicate rows | Duplicates |
Bilirubin is highly overall correlated with Copper | High correlation |
Copper is highly overall correlated with Bilirubin | High correlation |
Edema_N is highly overall correlated with Edema_S and 1 other fields | High correlation |
Edema_S is highly overall correlated with Edema_N | High correlation |
Edema_Y is highly overall correlated with Edema_N | High correlation |
is_male is highly imbalanced (79.9%) | Imbalance |
Ascites is highly imbalanced (92.5%) | Imbalance |
Edema_N is highly imbalanced (79.9%) | Imbalance |
Edema_S is highly imbalanced (85.9%) | Imbalance |
Edema_Y is highly imbalanced (91.0%) | Imbalance |
Status is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2024-01-03 08:06:16.113987 |
|---|---|
| Analysis finished | 2024-01-03 08:06:29.884559 |
| Duration | 13.77 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
N_years
Real number (ℝ)
| Distinct | 6524 |
|---|---|
| Distinct (%) | 51.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2591544 |
| Minimum | 0.11232877 |
|---|---|
| Maximum | 13.136986 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 0.11232877 |
|---|---|
| 5-th percentile | 1.9405427 |
| Q1 | 3.3808219 |
| median | 4.8254377 |
| Q3 | 6.7616438 |
| 95-th percentile | 10.407264 |
| Maximum | 13.136986 |
| Range | 13.024658 |
| Interquartile range (IQR) | 3.3808219 |
Descriptive statistics
| Standard deviation | 2.5687181 |
|---|---|
| Coefficient of variation (CV) | 0.48842797 |
| Kurtosis | -0.11970502 |
| Mean | 5.2591544 |
| Median Absolute Deviation (MAD) | 1.7042858 |
| Skewness | 0.62660198 |
| Sum | 66533.562 |
| Variance | 6.5983127 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.761643836 | 154 | 1.2% |
| 2.468493151 | 107 | 0.8% |
| 6.139726027 | 100 | 0.8% |
| 3.331506849 | 93 | 0.7% |
| 2.106849315 | 86 | 0.7% |
| 2.476712329 | 83 | 0.7% |
| 3.928767123 | 68 | 0.5% |
| 9.438356164 | 67 | 0.5% |
| 2.969863014 | 65 | 0.5% |
| 6.780821918 | 62 | 0.5% |
| Other values (6514) | 11766 |
| Value | Count | Frequency (%) |
| 0.1123287671 | 2 | < 0.1% |
| 0.1397260274 | 5 | < 0.1% |
| 0.1945205479 | 2 | < 0.1% |
| 0.2109589041 | 1 | < 0.1% |
| 0.2565826507 | 1 | < 0.1% |
| 0.2933346337 | 1 | < 0.1% |
| 0.301369863 | 17 | |
| 0.3358356974 | 1 | < 0.1% |
| 0.3429842032 | 1 | < 0.1% |
| 0.3502873616 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13.1369863 | 6 | < 0.1% |
| 12.8167842 | 1 | < 0.1% |
| 12.48219178 | 23 | |
| 12.39178082 | 7 | 0.1% |
| 12.35342466 | 16 | |
| 12.32876712 | 13 | |
| 12.29359753 | 1 | < 0.1% |
| 12.23835616 | 8 | 0.1% |
| 12.22838379 | 1 | < 0.1% |
| 12.21643836 | 12 |
Age
Real number (ℝ)
| Distinct | 6431 |
|---|---|
| Distinct (%) | 50.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.416377 |
| Minimum | 26.29589 |
|---|---|
| Maximum | 78.493151 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 26.29589 |
|---|---|
| 5-th percentile | 34.085263 |
| Q1 | 41.008364 |
| median | 47.937604 |
| Q3 | 55.448014 |
| 95-th percentile | 63.673973 |
| Maximum | 78.493151 |
| Range | 52.19726 |
| Interquartile range (IQR) | 14.439651 |
Descriptive statistics
| Standard deviation | 9.2681745 |
|---|---|
| Coefficient of variation (CV) | 0.19142643 |
| Kurtosis | -0.50304814 |
| Mean | 48.416377 |
| Median Absolute Deviation (MAD) | 7.0705293 |
| Skewness | 0.26644654 |
| Sum | 612515.59 |
| Variance | 85.899058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.21369863 | 129 | 1.0% |
| 40.28767123 | 118 | 0.9% |
| 36.51780822 | 113 | 0.9% |
| 40.92876712 | 102 | 0.8% |
| 46.41369863 | 100 | 0.8% |
| 41.18082192 | 91 | 0.7% |
| 61.3369863 | 73 | 0.6% |
| 52.21917808 | 72 | 0.6% |
| 56.66849315 | 65 | 0.5% |
| 53.34246575 | 57 | 0.5% |
| Other values (6421) | 11731 |
| Value | Count | Frequency (%) |
| 26.29589041 | 13 | |
| 28.90410959 | 16 | |
| 29.1247559 | 1 | < 0.1% |
| 29.1593806 | 1 | < 0.1% |
| 29.23091088 | 1 | < 0.1% |
| 29.33548947 | 1 | < 0.1% |
| 29.46141509 | 1 | < 0.1% |
| 29.50726448 | 1 | < 0.1% |
| 29.57534247 | 5 | < 0.1% |
| 29.72796708 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 78.49315068 | 6 | < 0.1% |
| 77.39047862 | 1 | < 0.1% |
| 77.17238265 | 1 | < 0.1% |
| 76.86820419 | 1 | < 0.1% |
| 76.76164384 | 3 | < 0.1% |
| 76.16647186 | 1 | < 0.1% |
| 75.40712541 | 1 | < 0.1% |
| 75.0630137 | 23 | |
| 74.6407131 | 1 | < 0.1% |
| 74.57534247 | 18 |
is_male
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 741.4 KiB |
| 0.0 | |
|---|---|
| 1.0 | 396 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 37953 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 12255 | |
| 1.0 | 396 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 12255 | |
| 1.0 | 396 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 24906 | |
| . | 12651 | |
| 1 | 396 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25302 | |
| Other Punctuation | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24906 | |
| 1 | 396 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 12651 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 24906 | |
| . | 12651 | |
| 1 | 396 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 24906 | |
| . | 12651 | |
| 1 | 396 | 1.0% |
Ascites
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 0 | |
|---|---|
| 1 | 116 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12535 | |
| 1 | 116 | 0.9% |
Hepatomegaly
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7646 | |
| 0 | 5005 |
Spiders
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9596 | |
| 1 | 3055 | 24.1% |
Bilirubin
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 5883 |
|---|---|
| Distinct (%) | 46.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9515906 |
| Minimum | 0.3 |
|---|---|
| Maximum | 6.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.84657995 |
| median | 1.4514639 |
| Q3 | 3 |
| 95-th percentile | 4.545268 |
| Maximum | 6.4 |
| Range | 6.1 |
| Interquartile range (IQR) | 2.1534201 |
Descriptive statistics
| Standard deviation | 1.3299612 |
|---|---|
| Coefficient of variation (CV) | 0.68147548 |
| Kurtosis | 0.44621523 |
| Mean | 1.9515906 |
| Median Absolute Deviation (MAD) | 0.77798416 |
| Skewness | 0.99161159 |
| Sum | 24689.573 |
| Variance | 1.7687967 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6 | 719 | 5.7% |
| 0.7 | 590 | 4.7% |
| 0.5 | 584 | 4.6% |
| 0.8 | 550 | 4.3% |
| 0.9 | 513 | 4.1% |
| 1.1 | 452 | 3.6% |
| 1.3 | 450 | 3.6% |
| 1 | 276 | 2.2% |
| 3.2 | 273 | 2.2% |
| 3.4 | 206 | 1.6% |
| Other values (5873) | 8038 |
| Value | Count | Frequency (%) |
| 0.3 | 51 | 0.4% |
| 0.3264924957 | 1 | < 0.1% |
| 0.3856655338 | 1 | < 0.1% |
| 0.3962994666 | 1 | < 0.1% |
| 0.4 | 176 | |
| 0.4056666939 | 1 | < 0.1% |
| 0.406503731 | 1 | < 0.1% |
| 0.4067682272 | 1 | < 0.1% |
| 0.4076799932 | 1 | < 0.1% |
| 0.4098887812 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6.4 | 55 | |
| 6.396264854 | 1 | < 0.1% |
| 6.383552051 | 1 | < 0.1% |
| 6.369475515 | 1 | < 0.1% |
| 6.355816062 | 1 | < 0.1% |
| 6.351186748 | 1 | < 0.1% |
| 6.335692967 | 1 | < 0.1% |
| 6.327429434 | 1 | < 0.1% |
| 6.322753681 | 1 | < 0.1% |
| 6.31603519 | 1 | < 0.1% |
Cholesterol
Real number (ℝ)
| Distinct | 5615 |
|---|---|
| Distinct (%) | 44.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 320.64242 |
| Minimum | 120 |
|---|---|
| Maximum | 588 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 120 |
|---|---|
| 5-th percentile | 205 |
| Q1 | 253 |
| median | 303 |
| Q3 | 374 |
| 95-th percentile | 495.02464 |
| Maximum | 588 |
| Range | 468 |
| Interquartile range (IQR) | 121 |
Descriptive statistics
| Standard deviation | 89.757772 |
|---|---|
| Coefficient of variation (CV) | 0.27993106 |
| Kurtosis | 0.12512306 |
| Mean | 320.64242 |
| Median Absolute Deviation (MAD) | 55.798092 |
| Skewness | 0.75961063 |
| Sum | 4056447.2 |
| Variance | 8056.4577 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 316 | 252 | 2.0% |
| 348 | 159 | 1.3% |
| 248 | 156 | 1.2% |
| 339 | 154 | 1.2% |
| 528 | 145 | 1.1% |
| 263 | 133 | 1.1% |
| 298 | 129 | 1.0% |
| 232 | 120 | 0.9% |
| 396 | 118 | 0.9% |
| 450 | 114 | 0.9% |
| Other values (5605) | 11171 |
| Value | Count | Frequency (%) |
| 120 | 8 | 0.1% |
| 127 | 30 | |
| 132 | 34 | |
| 134 | 1 | < 0.1% |
| 134.8036692 | 1 | < 0.1% |
| 140.4377783 | 1 | < 0.1% |
| 141.3352207 | 1 | < 0.1% |
| 145.7351726 | 1 | < 0.1% |
| 146.9403729 | 1 | < 0.1% |
| 147.8665235 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 588 | 1 | < 0.1% |
| 586 | 6 | < 0.1% |
| 585.925331 | 1 | < 0.1% |
| 585.2291857 | 1 | < 0.1% |
| 584.4519793 | 1 | < 0.1% |
| 584.3224204 | 1 | < 0.1% |
| 583.3434267 | 1 | < 0.1% |
| 580.9771332 | 1 | < 0.1% |
| 580.7628101 | 1 | < 0.1% |
| 578 | 40 |
Albumin
Real number (ℝ)
| Distinct | 6520 |
|---|---|
| Distinct (%) | 51.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5594897 |
| Minimum | 2.73 |
|---|---|
| Maximum | 4.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 2.73 |
|---|---|
| 5-th percentile | 3.0926727 |
| Q1 | 3.36 |
| median | 3.5684706 |
| Q3 | 3.74 |
| 95-th percentile | 4.0649078 |
| Maximum | 4.4 |
| Range | 1.67 |
| Interquartile range (IQR) | 0.38 |
Descriptive statistics
| Standard deviation | 0.28325069 |
|---|---|
| Coefficient of variation (CV) | 0.079576208 |
| Kurtosis | 0.059355912 |
| Mean | 3.5594897 |
| Median Absolute Deviation (MAD) | 0.19009797 |
| Skewness | 0.062726843 |
| Sum | 45031.104 |
| Variance | 0.080230953 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.35 | 312 | 2.5% |
| 3.6 | 283 | 2.2% |
| 3.7 | 263 | 2.1% |
| 3.85 | 215 | 1.7% |
| 3.77 | 192 | 1.5% |
| 3.5 | 171 | 1.4% |
| 3.2 | 166 | 1.3% |
| 3.65 | 154 | 1.2% |
| 3.61 | 146 | 1.2% |
| 3.18 | 138 | 1.1% |
| Other values (6510) | 10611 |
| Value | Count | Frequency (%) |
| 2.73 | 3 | < 0.1% |
| 2.74 | 5 | < 0.1% |
| 2.746479822 | 1 | < 0.1% |
| 2.75 | 45 | |
| 2.750005499 | 1 | < 0.1% |
| 2.765779037 | 1 | < 0.1% |
| 2.766645523 | 1 | < 0.1% |
| 2.768084834 | 1 | < 0.1% |
| 2.768940566 | 1 | < 0.1% |
| 2.77 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.4 | 14 | |
| 4.381728602 | 1 | < 0.1% |
| 4.38 | 19 | |
| 4.379063205 | 1 | < 0.1% |
| 4.374894481 | 1 | < 0.1% |
| 4.372396426 | 1 | < 0.1% |
| 4.367490733 | 1 | < 0.1% |
| 4.36649819 | 1 | < 0.1% |
| 4.364298933 | 1 | < 0.1% |
| 4.362204076 | 1 | < 0.1% |
Copper
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 5598 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.939009 |
| Minimum | 4 |
|---|---|
| Maximum | 196 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 42 |
| median | 67 |
| Q3 | 96 |
| 95-th percentile | 144.0955 |
| Maximum | 196 |
| Range | 192 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 38.326875 |
|---|---|
| Coefficient of variation (CV) | 0.53276901 |
| Kurtosis | -0.093961933 |
| Mean | 71.939009 |
| Median Absolute Deviation (MAD) | 26.162855 |
| Skewness | 0.65712891 |
| Sum | 910100.41 |
| Variance | 1468.9493 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 121 | 341 | 2.7% |
| 75 | 327 | 2.6% |
| 67 | 281 | 2.2% |
| 52 | 248 | 2.0% |
| 44 | 234 | 1.8% |
| 58 | 229 | 1.8% |
| 77 | 224 | 1.8% |
| 20 | 204 | 1.6% |
| 39 | 196 | 1.5% |
| 102 | 178 | 1.4% |
| Other values (5588) | 10189 |
| Value | Count | Frequency (%) |
| 4 | 12 | 0.1% |
| 4.420480248 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6.120768774 | 1 | < 0.1% |
| 6.648427237 | 1 | < 0.1% |
| 8.025748794 | 1 | < 0.1% |
| 8.136765134 | 1 | < 0.1% |
| 9 | 49 | |
| 9.057872445 | 1 | < 0.1% |
| 9.05983032 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 196 | 2 | < 0.1% |
| 190 | 1 | < 0.1% |
| 188 | 31 | |
| 187.8365304 | 1 | < 0.1% |
| 186.9250637 | 1 | < 0.1% |
| 186.729402 | 1 | < 0.1% |
| 186.5916134 | 1 | < 0.1% |
| 186.1774764 | 1 | < 0.1% |
| 186 | 15 | |
| 185.8676702 | 1 | < 0.1% |
Alk_Phos
Real number (ℝ)
| Distinct | 5372 |
|---|---|
| Distinct (%) | 42.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1303.6862 |
| Minimum | 289 |
|---|---|
| Maximum | 3336 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 289 |
|---|---|
| 5-th percentile | 622 |
| Q1 | 846.16726 |
| median | 1162 |
| Q3 | 1620 |
| 95-th percentile | 2520 |
| Maximum | 3336 |
| Range | 3047 |
| Interquartile range (IQR) | 773.83274 |
Descriptive statistics
| Standard deviation | 606.44671 |
|---|---|
| Coefficient of variation (CV) | 0.46517845 |
| Kurtosis | 1.4299794 |
| Mean | 1303.6862 |
| Median Absolute Deviation (MAD) | 371 |
| Skewness | 1.1997937 |
| Sum | 16492934 |
| Variance | 367777.61 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1345 | 275 | 2.2% |
| 1162 | 155 | 1.2% |
| 3336 | 147 | 1.2% |
| 938 | 135 | 1.1% |
| 2276 | 128 | 1.0% |
| 1440 | 118 | 0.9% |
| 663 | 111 | 0.9% |
| 1408 | 108 | 0.9% |
| 1136 | 107 | 0.8% |
| 1533 | 90 | 0.7% |
| Other values (5362) | 11277 |
| Value | Count | Frequency (%) |
| 289 | 44 | |
| 296.1044501 | 1 | < 0.1% |
| 302.9560298 | 1 | < 0.1% |
| 309.2518479 | 1 | < 0.1% |
| 310 | 9 | 0.1% |
| 313.6607064 | 1 | < 0.1% |
| 316.1203859 | 1 | < 0.1% |
| 319.7483787 | 1 | < 0.1% |
| 321.1275685 | 1 | < 0.1% |
| 323.655056 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3336 | 147 | |
| 3334.922441 | 1 | < 0.1% |
| 3332.733471 | 1 | < 0.1% |
| 3331.510513 | 1 | < 0.1% |
| 3331.149767 | 1 | < 0.1% |
| 3330.914915 | 1 | < 0.1% |
| 3327.097631 | 1 | < 0.1% |
| 3326.543882 | 1 | < 0.1% |
| 3325.222435 | 1 | < 0.1% |
| 3325.15926 | 1 | < 0.1% |
SGOT
Real number (ℝ)
| Distinct | 5592 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.60444 |
| Minimum | 26.35 |
|---|---|
| Maximum | 227.04 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 26.35 |
|---|---|
| 5-th percentile | 57.157658 |
| Q1 | 89.9 |
| median | 117.69824 |
| Q3 | 137.95 |
| 95-th percentile | 185.90894 |
| Maximum | 227.04 |
| Range | 200.69 |
| Interquartile range (IQR) | 48.05 |
Descriptive statistics
| Standard deviation | 37.681866 |
|---|---|
| Coefficient of variation (CV) | 0.32315978 |
| Kurtosis | -0.14118207 |
| Mean | 116.60444 |
| Median Absolute Deviation (MAD) | 24.698238 |
| Skewness | 0.33655685 |
| Sum | 1475162.8 |
| Variance | 1419.923 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 71.3 | 215 | 1.7% |
| 137.95 | 214 | 1.7% |
| 57.35 | 212 | 1.7% |
| 120.9 | 188 | 1.5% |
| 128.65 | 177 | 1.4% |
| 97.65 | 169 | 1.3% |
| 93 | 165 | 1.3% |
| 66.65 | 162 | 1.3% |
| 147.25 | 161 | 1.3% |
| 170.5 | 154 | 1.2% |
| Other values (5582) | 10834 |
| Value | Count | Frequency (%) |
| 26.35 | 6 | < 0.1% |
| 28.38 | 3 | < 0.1% |
| 40.6 | 1 | < 0.1% |
| 41.85 | 17 | |
| 43.00556478 | 1 | < 0.1% |
| 43.4 | 41 | |
| 43.51145398 | 1 | < 0.1% |
| 44.85377535 | 1 | < 0.1% |
| 44.91592986 | 1 | < 0.1% |
| 45 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 227.04 | 1 | < 0.1% |
| 225.1053899 | 1 | < 0.1% |
| 223.6178543 | 1 | < 0.1% |
| 222.6804644 | 1 | < 0.1% |
| 221.88 | 6 | |
| 221.768139 | 1 | < 0.1% |
| 221.65 | 3 | |
| 221.04 | 1 | < 0.1% |
| 220.8075528 | 1 | < 0.1% |
| 220.7350692 | 1 | < 0.1% |
Tryglicerides
Real number (ℝ)
| Distinct | 5673 |
|---|---|
| Distinct (%) | 44.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 106.82242 |
| Minimum | 33 |
|---|---|
| Maximum | 219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 58 |
| Q1 | 84 |
| median | 102.6483 |
| Q3 | 128 |
| 95-th percentile | 168 |
| Maximum | 219 |
| Range | 186 |
| Interquartile range (IQR) | 44 |
Descriptive statistics
| Standard deviation | 32.473196 |
|---|---|
| Coefficient of variation (CV) | 0.30399233 |
| Kurtosis | 0.29004162 |
| Mean | 106.82242 |
| Median Absolute Deviation (MAD) | 20.351703 |
| Skewness | 0.64018675 |
| Sum | 1351410.4 |
| Variance | 1054.5085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 84 | 313 | 2.5% |
| 146 | 274 | 2.2% |
| 118 | 256 | 2.0% |
| 91 | 234 | 1.8% |
| 90 | 221 | 1.7% |
| 137 | 211 | 1.7% |
| 68 | 206 | 1.6% |
| 85 | 204 | 1.6% |
| 113 | 188 | 1.5% |
| 78 | 178 | 1.4% |
| Other values (5663) | 10366 |
| Value | Count | Frequency (%) |
| 33 | 11 | 0.1% |
| 33.41462783 | 1 | < 0.1% |
| 36.11058286 | 1 | < 0.1% |
| 38.34057846 | 1 | < 0.1% |
| 40.05872905 | 1 | < 0.1% |
| 43.06135588 | 1 | < 0.1% |
| 44 | 35 | |
| 44.78446921 | 1 | < 0.1% |
| 45.36234662 | 1 | < 0.1% |
| 45.81373079 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 219 | 7 | 0.1% |
| 218 | 6 | < 0.1% |
| 216.9723028 | 1 | < 0.1% |
| 215.3684756 | 1 | < 0.1% |
| 214 | 33 | |
| 213.1506125 | 1 | < 0.1% |
| 213 | 17 | |
| 212.788025 | 1 | < 0.1% |
| 211.9013838 | 1 | < 0.1% |
| 210.8061147 | 1 | < 0.1% |
Platelets
Real number (ℝ)
| Distinct | 5821 |
|---|---|
| Distinct (%) | 46.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 262.52557 |
| Minimum | 62 |
|---|---|
| Maximum | 474 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 62 |
|---|---|
| 5-th percentile | 135.52923 |
| Q1 | 213 |
| median | 258.11736 |
| Q3 | 309.9353 |
| 95-th percentile | 427 |
| Maximum | 474 |
| Range | 412 |
| Interquartile range (IQR) | 96.935296 |
Descriptive statistics
| Standard deviation | 80.824691 |
|---|---|
| Coefficient of variation (CV) | 0.3078736 |
| Kurtosis | -0.030523109 |
| Mean | 262.52557 |
| Median Absolute Deviation (MAD) | 49.117358 |
| Skewness | 0.32327866 |
| Sum | 3321211 |
| Variance | 6532.6307 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 445 | 221 | 1.7% |
| 344 | 184 | 1.5% |
| 248 | 166 | 1.3% |
| 228 | 163 | 1.3% |
| 467 | 157 | 1.2% |
| 251 | 151 | 1.2% |
| 238 | 149 | 1.2% |
| 295 | 145 | 1.1% |
| 156 | 138 | 1.1% |
| 224 | 137 | 1.1% |
| Other values (5811) | 11040 |
| Value | Count | Frequency (%) |
| 62 | 3 | < 0.1% |
| 70 | 2 | < 0.1% |
| 71 | 5 | < 0.1% |
| 76 | 1 | < 0.1% |
| 78.31447354 | 1 | < 0.1% |
| 79 | 13 | |
| 80 | 3 | < 0.1% |
| 80.15243046 | 1 | < 0.1% |
| 80.28200419 | 1 | < 0.1% |
| 81 | 7 |
| Value | Count | Frequency (%) |
| 474 | 10 | 0.1% |
| 471.6235288 | 1 | < 0.1% |
| 471 | 8 | 0.1% |
| 467 | 157 | |
| 466.8547027 | 1 | < 0.1% |
| 466.2533335 | 1 | < 0.1% |
| 466.0722616 | 1 | < 0.1% |
| 464.1230986 | 1 | < 0.1% |
| 464 | 1 | < 0.1% |
| 463.761467 | 1 | < 0.1% |
Prothrombin
Real number (ℝ)
| Distinct | 5818 |
|---|---|
| Distinct (%) | 46.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.580938 |
| Minimum | 9 |
|---|---|
| Maximum | 12.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.0 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 9.7 |
| Q1 | 10.095267 |
| median | 10.6 |
| Q3 | 11 |
| 95-th percentile | 11.604313 |
| Maximum | 12.5 |
| Range | 3.5 |
| Interquartile range (IQR) | 0.90473261 |
Descriptive statistics
| Standard deviation | 0.60089942 |
|---|---|
| Coefficient of variation (CV) | 0.056790753 |
| Kurtosis | -0.37264896 |
| Mean | 10.580938 |
| Median Absolute Deviation (MAD) | 0.46457089 |
| Skewness | 0.36708774 |
| Sum | 133859.44 |
| Variance | 0.36108012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.6 | 931 | 7.4% |
| 10 | 819 | 6.5% |
| 11 | 684 | 5.4% |
| 10.1 | 451 | 3.6% |
| 9.9 | 447 | 3.5% |
| 9.8 | 346 | 2.7% |
| 10.9 | 325 | 2.6% |
| 10.2 | 286 | 2.3% |
| 10.7 | 262 | 2.1% |
| 9.6 | 258 | 2.0% |
| Other values (5808) | 7842 |
| Value | Count | Frequency (%) |
| 9 | 9 | 0.1% |
| 9.044752702 | 1 | < 0.1% |
| 9.1 | 9 | 0.1% |
| 9.2 | 4 | < 0.1% |
| 9.3 | 7 | 0.1% |
| 9.342620491 | 1 | < 0.1% |
| 9.4 | 15 | 0.1% |
| 9.5 | 124 | |
| 9.507821909 | 1 | < 0.1% |
| 9.515172206 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 12.5 | 2 | < 0.1% |
| 12.44917219 | 1 | < 0.1% |
| 12.44359226 | 1 | < 0.1% |
| 12.40018171 | 1 | < 0.1% |
| 12.4 | 19 | |
| 12.39273078 | 1 | < 0.1% |
| 12.38196979 | 1 | < 0.1% |
| 12.37284104 | 1 | < 0.1% |
| 12.37231458 | 1 | < 0.1% |
| 12.37013596 | 1 | < 0.1% |
Stage
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 741.4 KiB |
| 4.0 | |
|---|---|
| 3.0 | |
| 2.0 | |
| 1.0 | 319 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 37953 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 4.0 |
| 4th row | 3.0 |
| 5th row | 4.0 |
Common Values
| Value | Count | Frequency (%) |
| 4.0 | 5336 | |
| 3.0 | 5143 | |
| 2.0 | 1853 | 14.6% |
| 1.0 | 319 | 2.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4.0 | 5336 | |
| 3.0 | 5143 | |
| 2.0 | 1853 | 14.6% |
| 1.0 | 319 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 12651 | |
| 0 | 12651 | |
| 4 | 5336 | |
| 3 | 5143 | |
| 2 | 1853 | 4.9% |
| 1 | 319 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25302 | |
| Other Punctuation | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12651 | |
| 4 | 5336 | |
| 3 | 5143 | |
| 2 | 1853 | 7.3% |
| 1 | 319 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 12651 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 12651 | |
| 0 | 12651 | |
| 4 | 5336 | |
| 3 | 5143 | |
| 2 | 1853 | 4.9% |
| 1 | 319 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 12651 | |
| 0 | 12651 | |
| 4 | 5336 | |
| 3 | 5143 | |
| 2 | 1853 | 4.9% |
| 1 | 319 | 0.8% |
Status
Categorical
UNIFORM 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 2 | |
|---|---|
| 0 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4217 | |
| 0 | 4217 | |
| 1 | 4217 |
took_drug
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6431 | |
| 0 | 6220 |
Edema_N
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 1 | |
|---|---|
| 0 | 396 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 12255 | |
| 0 | 396 | 3.1% |
Edema_S
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 0 | |
|---|---|
| 1 | 251 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12400 | |
| 1 | 251 | 2.0% |
Edema_Y
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 716.7 KiB |
| 0 | |
|---|---|
| 1 | 145 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12651 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12651 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12651 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12651 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12506 | |
| 1 | 145 | 1.1% |
| Age | Albumin | Alk_Phos | Ascites | Bilirubin | Cholesterol | Copper | Edema_N | Edema_S | Edema_Y | Hepatomegaly | N_years | Platelets | Prothrombin | SGOT | Spiders | Stage | Status | Tryglicerides | is_male | took_drug | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | -0.056 | -0.142 | 0.079 | -0.061 | -0.147 | -0.060 | 0.114 | 0.102 | 0.069 | 0.119 | -0.055 | -0.137 | 0.110 | -0.123 | 0.058 | 0.096 | 0.313 | -0.036 | 0.114 | 0.076 |
| Albumin | -0.056 | 1.000 | -0.155 | 0.218 | -0.236 | -0.022 | -0.191 | 0.126 | 0.047 | 0.144 | 0.214 | 0.193 | 0.113 | -0.087 | -0.171 | 0.123 | 0.135 | 0.158 | -0.033 | 0.049 | 0.119 |
| Alk_Phos | -0.142 | -0.155 | 1.000 | 0.033 | 0.299 | 0.313 | 0.196 | 0.084 | 0.112 | 0.071 | 0.195 | -0.111 | 0.135 | 0.014 | 0.490 | 0.201 | 0.130 | 0.199 | 0.156 | 0.116 | 0.261 |
| Ascites | 0.079 | 0.218 | 0.033 | 1.000 | 0.057 | -0.084 | 0.007 | 0.271 | 0.048 | 0.375 | 0.063 | -0.100 | -0.111 | 0.118 | 0.004 | 0.086 | 0.074 | 0.121 | -0.015 | 0.016 | 0.000 |
| Bilirubin | -0.061 | -0.236 | 0.299 | 0.057 | 1.000 | 0.338 | 0.541 | 0.058 | 0.037 | 0.090 | 0.417 | -0.361 | -0.109 | 0.194 | 0.438 | 0.288 | 0.208 | 0.424 | 0.178 | 0.066 | 0.092 |
| Cholesterol | -0.147 | -0.022 | 0.313 | -0.084 | 0.338 | 1.000 | 0.204 | 0.066 | 0.059 | 0.096 | 0.127 | -0.080 | 0.135 | -0.059 | 0.339 | 0.099 | 0.081 | 0.185 | 0.269 | 0.073 | 0.196 |
| Copper | -0.060 | -0.191 | 0.196 | 0.007 | 0.541 | 0.204 | 1.000 | 0.073 | 0.053 | 0.113 | 0.330 | -0.317 | -0.080 | 0.090 | 0.341 | 0.281 | 0.159 | 0.311 | 0.237 | 0.107 | 0.118 |
| Edema_N | 0.114 | 0.126 | 0.084 | 0.271 | 0.058 | 0.066 | 0.073 | 1.000 | 0.790 | 0.597 | 0.066 | 0.062 | 0.063 | -0.145 | -0.013 | 0.125 | 0.083 | 0.139 | 0.054 | 0.044 | 0.036 |
| Edema_S | 0.102 | 0.047 | 0.112 | 0.048 | 0.037 | 0.059 | 0.053 | 0.790 | 1.000 | 0.009 | 0.026 | -0.039 | 0.002 | 0.076 | 0.008 | 0.045 | 0.037 | 0.073 | -0.034 | 0.047 | 0.027 |
| Edema_Y | 0.069 | 0.144 | 0.071 | 0.375 | 0.090 | 0.096 | 0.113 | 0.597 | 0.009 | 1.000 | 0.072 | -0.050 | -0.106 | 0.138 | 0.012 | 0.143 | 0.090 | 0.142 | -0.044 | 0.000 | 0.021 |
| Hepatomegaly | 0.119 | 0.214 | 0.195 | 0.063 | 0.417 | 0.127 | 0.330 | 0.066 | 0.026 | 0.072 | 1.000 | -0.217 | -0.260 | 0.176 | 0.194 | 0.302 | 0.497 | 0.401 | 0.151 | 0.008 | 0.102 |
| N_years | -0.055 | 0.193 | -0.111 | -0.100 | -0.361 | -0.080 | -0.317 | 0.062 | -0.039 | -0.050 | -0.217 | 1.000 | 0.130 | -0.083 | -0.195 | 0.186 | 0.158 | 0.268 | -0.113 | 0.056 | 0.071 |
| Platelets | -0.137 | 0.113 | 0.135 | -0.111 | -0.109 | 0.135 | -0.080 | 0.063 | 0.002 | -0.106 | -0.260 | 0.130 | 1.000 | -0.205 | -0.039 | 0.239 | 0.124 | 0.214 | 0.054 | 0.079 | 0.081 |
| Prothrombin | 0.110 | -0.087 | 0.014 | 0.118 | 0.194 | -0.059 | 0.090 | -0.145 | 0.076 | 0.138 | 0.176 | -0.083 | -0.205 | 1.000 | 0.079 | 0.214 | 0.127 | 0.273 | -0.110 | 0.061 | 0.060 |
| SGOT | -0.123 | -0.171 | 0.490 | 0.004 | 0.438 | 0.339 | 0.341 | -0.013 | 0.008 | 0.012 | 0.194 | -0.195 | -0.039 | 0.079 | 1.000 | 0.192 | 0.137 | 0.302 | 0.090 | 0.095 | 0.123 |
| Spiders | 0.058 | 0.123 | 0.201 | 0.086 | 0.288 | 0.099 | 0.281 | 0.125 | 0.045 | 0.143 | 0.302 | 0.186 | 0.239 | 0.214 | 0.192 | 1.000 | 0.199 | 0.209 | -0.029 | 0.050 | 0.028 |
| Stage | 0.096 | 0.135 | 0.130 | 0.074 | 0.208 | 0.081 | 0.159 | 0.083 | 0.037 | 0.090 | 0.497 | 0.158 | 0.124 | 0.127 | 0.137 | 0.199 | 1.000 | 0.319 | 0.045 | 0.022 | 0.015 |
| Status | 0.313 | 0.158 | 0.199 | 0.121 | 0.424 | 0.185 | 0.311 | 0.139 | 0.073 | 0.142 | 0.401 | 0.268 | 0.214 | 0.273 | 0.302 | 0.209 | 0.319 | 1.000 | 0.053 | 0.118 | 0.058 |
| Tryglicerides | -0.036 | -0.033 | 0.156 | -0.015 | 0.178 | 0.269 | 0.237 | 0.054 | -0.034 | -0.044 | 0.151 | -0.113 | 0.054 | -0.110 | 0.090 | -0.029 | 0.045 | 0.053 | 1.000 | 0.057 | 0.157 |
| is_male | 0.114 | 0.049 | 0.116 | 0.016 | 0.066 | 0.073 | 0.107 | 0.044 | 0.047 | 0.000 | 0.008 | 0.056 | 0.079 | 0.061 | 0.095 | 0.050 | 0.022 | 0.118 | 0.057 | 1.000 | 0.000 |
| took_drug | 0.076 | 0.119 | 0.261 | 0.000 | 0.092 | 0.196 | 0.118 | 0.036 | 0.027 | 0.021 | 0.102 | 0.071 | 0.081 | 0.060 | 0.123 | 0.028 | 0.015 | 0.058 | 0.157 | 0.000 | 1.000 |
| N_years | Age | is_male | Ascites | Hepatomegaly | Spiders | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Stage | Status | took_drug | Edema_N | Edema_S | Edema_Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2.736986 | 58.991781 | 1.0 | 0 | 0 | 0 | 2.3 | 316.0 | 3.35 | 172.0 | 1601.0 | 179.80 | 63.0 | 394.0 | 9.7 | 3.0 | 2 | 1 | 1 | 0 | 0 |
| 1 | 7.052055 | 52.704110 | 0.0 | 0 | 0 | 0 | 0.9 | 364.0 | 3.54 | 63.0 | 1440.0 | 134.85 | 88.0 | 361.0 | 11.0 | 3.0 | 0 | 0 | 1 | 0 | 0 |
| 2 | 9.391781 | 37.608219 | 0.0 | 0 | 1 | 1 | 3.3 | 299.0 | 3.55 | 131.0 | 1029.0 | 119.35 | 50.0 | 199.0 | 11.7 | 4.0 | 2 | 0 | 0 | 0 | 1 |
| 3 | 7.057534 | 50.575342 | 0.0 | 0 | 0 | 0 | 0.6 | 256.0 | 3.50 | 58.0 | 1653.0 | 71.30 | 96.0 | 269.0 | 10.7 | 3.0 | 0 | 0 | 1 | 0 | 0 |
| 4 | 2.158904 | 45.638356 | 0.0 | 0 | 1 | 0 | 1.1 | 346.0 | 3.65 | 63.0 | 1181.0 | 125.55 | 96.0 | 298.0 | 10.6 | 4.0 | 0 | 0 | 1 | 0 | 0 |
| 5 | 3.561644 | 48.501370 | 0.0 | 0 | 0 | 0 | 1.0 | 328.0 | 3.35 | 43.0 | 1677.0 | 137.95 | 90.0 | 291.0 | 9.8 | 3.0 | 0 | 0 | 1 | 0 | 0 |
| 6 | 4.424658 | 58.304110 | 0.0 | 0 | 1 | 0 | 0.6 | 273.0 | 3.94 | 36.0 | 598.0 | 52.70 | 214.0 | 227.0 | 9.9 | 3.0 | 0 | 0 | 1 | 0 | 0 |
| 7 | 5.616438 | 56.668493 | 0.0 | 0 | 0 | 0 | 0.7 | 360.0 | 3.65 | 72.0 | 3196.0 | 94.55 | 154.0 | 269.0 | 9.8 | 2.0 | 0 | 1 | 1 | 0 | 0 |
| 8 | 7.164384 | 41.120548 | 0.0 | 0 | 0 | 0 | 0.9 | 478.0 | 3.60 | 39.0 | 1758.0 | 171.00 | 140.0 | 234.0 | 10.6 | 2.0 | 0 | 1 | 1 | 0 | 0 |
| 9 | 9.810959 | 70.608219 | 0.0 | 0 | 0 | 0 | 0.5 | 252.0 | 3.60 | 26.0 | 377.0 | 56.76 | 185.0 | 336.0 | 10.0 | 2.0 | 0 | 0 | 1 | 0 | 0 |
| N_years | Age | is_male | Ascites | Hepatomegaly | Spiders | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Stage | Status | took_drug | Edema_N | Edema_S | Edema_Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 12641 | 4.074124 | 50.190912 | 0.0 | 0 | 1 | 1 | 2.688480 | 224.460794 | 3.578298 | 124.923201 | 1045.407357 | 120.291904 | 104.430720 | 182.592643 | 11.039232 | 4.0 | 2 | 0 | 1 | 0 | 0 |
| 12642 | 6.030967 | 58.486872 | 0.0 | 0 | 1 | 0 | 0.722277 | 258.000000 | 3.914030 | 49.000000 | 559.000000 | 43.400000 | 148.639637 | 279.488124 | 11.080098 | 4.0 | 2 | 0 | 1 | 0 | 0 |
| 12643 | 4.345443 | 47.416792 | 0.0 | 0 | 1 | 1 | 1.653824 | 327.153596 | 3.638445 | 129.537587 | 932.231694 | 150.324485 | 159.770580 | 337.076473 | 10.846144 | 4.0 | 2 | 1 | 1 | 0 | 0 |
| 12644 | 7.292234 | 56.144677 | 0.0 | 0 | 1 | 1 | 1.613946 | 248.845699 | 3.637863 | 46.382796 | 1105.347206 | 102.157257 | 72.124623 | 146.264085 | 11.067656 | 4.0 | 2 | 1 | 1 | 0 | 0 |
| 12645 | 2.476712 | 63.329379 | 0.0 | 0 | 1 | 0 | 3.596415 | 396.000000 | 3.192760 | 58.000000 | 1440.000000 | 153.450000 | 131.000000 | 163.451625 | 10.137993 | 4.0 | 2 | 1 | 1 | 0 | 0 |
| 12646 | 2.274149 | 60.840896 | 0.0 | 0 | 1 | 1 | 2.905185 | 394.131438 | 3.297450 | 158.250991 | 1820.828596 | 132.069302 | 133.932300 | 202.733161 | 11.156572 | 4.0 | 2 | 1 | 1 | 0 | 0 |
| 12647 | 6.008568 | 48.824212 | 0.0 | 0 | 1 | 0 | 3.478799 | 444.275612 | 3.614099 | 45.484101 | 2045.000000 | 89.900000 | 111.332158 | 225.000000 | 9.712014 | 4.0 | 2 | 1 | 1 | 0 | 0 |
| 12648 | 6.773519 | 51.713110 | 0.0 | 0 | 1 | 1 | 2.157310 | 261.895800 | 3.521765 | 123.416801 | 792.250402 | 91.434347 | 91.521001 | 179.905898 | 11.052100 | 4.0 | 2 | 0 | 1 | 0 | 0 |
| 12649 | 7.724548 | 56.803764 | 0.0 | 0 | 1 | 1 | 2.000000 | 267.000000 | 3.078395 | 89.000000 | 754.000000 | 195.286150 | 90.000000 | 140.565985 | 11.800000 | 4.0 | 2 | 0 | 1 | 0 | 0 |
| 12650 | 3.438130 | 48.262560 | 0.0 | 0 | 1 | 1 | 3.071357 | 267.281416 | 3.574505 | 97.276402 | 1160.278073 | 110.050000 | 81.994986 | 200.000000 | 9.828476 | 4.0 | 2 | 1 | 1 | 0 | 0 |
Most frequently occurring
| N_years | Age | is_male | Ascites | Hepatomegaly | Spiders | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Stage | Status | took_drug | Edema_N | Edema_S | Edema_Y | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17 | 6.761644 | 47.213699 | 0.0 | 0 | 1 | 0 | 1.3 | 316.0 | 3.51 | 75.0 | 1162.0 | 147.25 | 137.0 | 238.0 | 10.0 | 3.0 | 1 | 0 | 1 | 0 | 0 | 23 |
| 13 | 6.139726 | 40.287671 | 0.0 | 0 | 0 | 0 | 0.5 | 201.0 | 3.73 | 44.0 | 1345.0 | 54.25 | 145.0 | 445.0 | 10.1 | 3.0 | 1 | 0 | 1 | 0 | 0 | 17 |
| 9 | 4.120548 | 38.131507 | 0.0 | 0 | 1 | 1 | 3.4 | 279.0 | 3.53 | 143.0 | 671.0 | 113.15 | 72.0 | 136.0 | 10.9 | 3.0 | 1 | 1 | 1 | 0 | 0 | 8 |
| 12 | 6.139726 | 40.287671 | 0.0 | 0 | 0 | 0 | 0.5 | 201.0 | 3.73 | 44.0 | 1345.0 | 54.25 | 145.0 | 445.0 | 10.1 | 2.0 | 1 | 0 | 1 | 0 | 0 | 6 |
| 14 | 6.438356 | 41.180822 | 0.0 | 0 | 0 | 0 | 5.5 | 528.0 | 4.18 | 77.0 | 2404.0 | 172.05 | 78.0 | 467.0 | 10.7 | 3.0 | 1 | 1 | 1 | 0 | 0 | 6 |
| 18 | 6.761644 | 47.213699 | 0.0 | 0 | 1 | 0 | 1.3 | 316.0 | 3.51 | 75.0 | 1162.0 | 147.25 | 137.0 | 238.0 | 10.0 | 4.0 | 1 | 0 | 1 | 0 | 0 | 6 |
| 19 | 6.780822 | 36.517808 | 0.0 | 0 | 0 | 0 | 3.4 | 450.0 | 3.37 | 32.0 | 1408.0 | 116.25 | 118.0 | 313.0 | 11.2 | 3.0 | 1 | 1 | 1 | 0 | 0 | 6 |
| 3 | 2.468493 | 40.928767 | 0.0 | 0 | 1 | 0 | 3.2 | 339.0 | 3.18 | 123.0 | 3336.0 | 205.00 | 84.0 | 304.0 | 9.9 | 4.0 | 1 | 1 | 1 | 0 | 0 | 4 |
| 4 | 2.476712 | 61.336986 | 0.0 | 0 | 1 | 0 | 3.9 | 396.0 | 3.20 | 58.0 | 1440.0 | 153.45 | 131.0 | 156.0 | 10.0 | 4.0 | 2 | 1 | 1 | 0 | 0 | 4 |
| 1 | 1.460274 | 56.024658 | 0.0 | 0 | 1 | 1 | 1.2 | 275.0 | 3.43 | 100.0 | 1142.0 | 75.00 | 91.0 | 217.0 | 11.3 | 3.0 | 1 | 1 | 1 | 0 | 0 | 3 |